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SEQUENCE LISTING 

<110> CNRS 

<120> STARCH GRANULES CONTAINING A RECOMBINANT POLYPEPTIDE 

OF INTEREST, A METHOD OF OBTAINING THEM AND THEIR USES 

<130> WOB 99 AB CNR AMYL 

<140> US 09/980,771 
<141> 2001-11-15 

<150> FR 99/06494 
<151> 1999-05-21' 

<160> 11 

<170> Patentin version 3.1 

<210> 1 
<211> 3117 

<212> DNA ) 
<213> Chlamydomonas reinhardtii 

<400> 1 

gccaggcgga gcgtatggct gttgcctcta ccagccgccc cagcagcgcg cgtcctatcg 60 

tcatcaatgc cgcgtcgttc ggtgtcaaga agaccgcgaa ccagctgctg cgtgagcttg 120 

ctcgtggctc cgcacgcaag tccacctcgc gctcggctgt tactggtgcc actggtgcca 180 

cttgcgcgct ggacatcgtg atggttgctg ctgaggtcgc cccttggtcc aagacgggcg 240 

gcctgggcga tgtgactggt ggcctgccta ttgagctggt caagcgcggc caccgcgtca 300 

tgaccattgc ccctcgctac gaccagtacg ctgacgcctg ggacacctcg gtggtcgtgg 360 

acatcatggg cgagaaggtc cgctacttcc actccatcaa gaagggcgtg caccgcgtgt 420 

ggattgacca cccctggttc ctggccaagg tctggggcaa gaccggctcc aagctgtacg 480 

gcccccgctc cggcgctgac tacctggaca accacaagcg cttcgccctg ttctgcaagg 540 

ccgctattga ggctgcccgc gtgctgccct tcggccccgg cgaggactgc gtcttcgtgg 600 

ccaacgactg gcactccgcc ctggtgcccg tcctgctgaa ggacgagtac cagcccaagg 660 

gccagttcac caaggccaag tcggtgctgg ctatccacaa catcgccttc cagggccgca 720 

tgtgggagga ggctttcaag gacacgaagc tgccccaggc cgcctttgac aagctggcct 780 

tctcggacgg ctatgccaag gtttacactg aggccacccc catggaggag gacgagaagc 84 0 

ccccgctgac gggaaagacc tacaagaaga tcaactggct gaagggtggc attatcgccg 900 

ccgacaagct ggtgactgtg tcgcccaact acgcgaccga gatcgctgcc gatgccgccg 960 

gcggtgtgga gctggacacc gtcatccgcg ccaagggcat tgagggcatt gtgaacggca 1020 

tggacattga ggagtggaac cccaagaccg acaagttcct gtctgcgccc tacgaccaga 1080 

acagcgtcta cgccggcaag gccgccgcca aggaggccct gcaggccgag ctgggcctgc 114 0 

ctgtggaccc caccgccccc ctgttcgcct tcatcggccg cctggaggag cagaagggtg 1200 

tggacatcat cctggccgcc ctgcccaaga tcctggccac ccccaaggtg cagatcgcca 1260 

tcctgggtac cggcaaggcc gcctacgaga agctggtgaa cgccatcggc accaagtaca 1320 

agggccgcgc caagggcgtg gtcaagttct cggcgcccct ggcgcacatg ctcaccgccg 1380 

gcgccgactt catgctggtg ccctcgcgct tcgagccctg cggcctgatc cagctgcacg 14 4 0 

ccatgcacta cggtaccgtg cccgtggtag cctccaccgg cggcctggtc gacaccgtca 1500 

aggagggcgt caccggcttc cacatgggcg ccctgaaccc cgacaagctg gacgaggctg 1560 

acgccgacgc cctggccgcc accgtgcgcc gtgccagcga ggtgtttgcg ggcggccgct 1620 

accccgagat ggtggccaac tgcatcagcc aggacctgtc ctggtccaag cccgcccaga 1680 

agtgggaggg cctgctggag gaggtggtgt acggcaaggg cggcgtggcc accgccaaga 1740 

aggaggagat caaggtgccc gttgccgaga agatccccgg cgacctgccc gccgtgtcct 1800 

acgcccccaa caccctgaag cccgtgtccg cctccgtgga gggcaacggc gccgccgcgc 18 60 

ccaaggtcgg caccaccgcc cccgccatgg gcgcgtggcg cgcgaccacc ccctcgggcc 1920 

cctcgcccgc cgccgccacc cccaaggtga ccacctacaa gcccgccctg cccgccaccg 1980 

ccaagcccaa gaccgctggc ctcaagctgg ccggtgaggc ctccaccacc tcgacctcgg 2040 

agaacggcgc tgcctccaac ggcaacggca acggtgcctc ggcctccaag acctcggctg 2100 

ccaagcccct ggtctccgcc gccacccgca agtccgccta aagcggcagt agccgcagag 2160 
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gc^gcgacag 
gcggccgcca 
gagcatggca 
tgaagtcggt 
gtttcggttt 
tgcgcggcgc 
tcggcggcaa 
gagagtttgc 
cctccggcaa 
ggatacataa 
ggttttgttg 
cttgggctca 
ggctcgaagc 
cttgactgga 
acccagtagt 
tgagagagta 



catgagcggc 
ccggcgagga 
ggcggaaacg 
tttgcgagac 
tgcccaagtc 
ggcgaccacg 
gagggcgggt 
cattttgtgg 
cggagaagtg 
gagcgtgtgg 
aggtgcagac 
tttttggtgc 
gagggtttgt 
ggtacacaaa 
gtggaaagta 
acaaagatgg 



tcgaccaaag 
gcaggcttgc 
tgtgtactga 
cggagaaacg 
caaaagaaga 
ccggcccgtg 
ctgtacactc 
gacgtgagcg 
ggaggcgctg 
aatggtggta 
ggcaccgtcg 
ccgtcgatga 
ggaagtggag 
gtggagcgtg 
gttggacctg 
cacctgaaaa 



ctgtggcagg 
ggcagcgagg 
aatgtggtgc 
ccggtttggt 
gtaacgaaac 
cgcagcctgt 
catccattcc 
gcgggacggc 
tagcccggtg 
aaagaggagg 
gcgtcaaagg 
tgagagattg 
cgaggagggt 
cgacggcacg 
aattctttga 
aaaaaaaaaa 



aacggctgta 
gcgatgagct 
atgagagtgt 
tttgtagtgc 
tgtagcagta 
cctgccctca 
aggatttttg 
cgcgccgggc 
accccccaat 
ggcctgggtc 
ccctcgcaag 
gccagcggtt 
tggagaaaga 
gaggcattgg 
gagtaccgcg 
aaaaaaaaaa 



gcagcggcag 
tagcggccgt 
cgtgctgtaa 
agggcctgtg 
gcagagcact 
gccttgtgat 
caggctgcct 
tctcctaccg 
gtagaggatg 
gcccctcgat 
gcccgggtgc 
ttttgaggct 
ggcggacatg 
cggactattg 
cattaatccg 
aaaaaaa 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3117 



<210> 2 

<211> 2124 

<212> DNA 

<213> artificial seguence 
<220> 

<223> fragment of the complete sequence of cDNA coding 
for the GBSSI of Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> {1)..(2124) 

<223> 

<400> 2 

atg get gtt gcc tct acc age cgc ccc age age geg cgt cct ate gtc 4 8 

Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro lie Val 
15 10 15 

ate aat gcc gcg teg ttc ggt gte aag aag acc geg aac cag ctg etg 96 
lie Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 
20 25 30 

cgt gag ctt get egt ggc tee gea ege aag tec acc teg cgc teg get 14 4 

Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Arg Ser Ala 
35 40 45 

gtt act ggt gcc act ggt gcc act tgc gcg ctg gac ate gtg atg gtt 192 
Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp He Val Met Val 
50 55 60 

get get gag gte gcc cct tgg tec aag acg ggc ggc ctg gge gat gtg 240 
Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 
65 70 75 80 

act ggt ggc ctg cct att gag ctg gtc aag cgc ggc eac cgc gtc atg 288 
Thr Gly Gly Leu Pro He Glu Leu Val Lys Arg Gly His Arg Val Met 
85 90 95 

acc att gcc cct cgc tac gac cag tac get gac gee tgg gac acc teg 336 
Thr He Ala Pro Arg Tyr Asp Gin Tyr Ala Asp Ala Trp Asp Thr Ser 
100 105 110 
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teg gac ggc tat gcc aag gtt tac act gag gcc acc ccc atg gag gag 
Ser Asp Gly Tyr Ala Lys Val Tyr Thr Glu Ala Thr Pro Met Glu Glu 
260 265 270 



ctg aag ggt ggc att ate gcc gcc gac aag ctg gtg act gtg teg ccc 
Leu Lys Gly Gly lie He Ala Ala Asp Lys Leu Val Thr Val Ser Pro 
290 295 300 



gac acc gtc ate cgc gcc aag ggc att gag ggc att gtg aac ggc atg 
Asp Thr Val He Arg Ala Lys Gly He Glu Gly He Val Asn Gly Met 
325 330 335 



432 



480 



gtg gtc gtg gac ate atg ggc gag aag gtc cgc tac ttc cac tec ate 384 
Val Val Val Asp He Met Gly Glu Lys Val Arg Tyr Phe His Ser He 
115 120 125 

aag aag ggc gtg cac cgc gtg tgg att gac cac ccc tgg ttc ctg gcc 
Lys Lys Gly Val His Arg Val Trp He Asp His Pro Trp Phe Leu Ala 
130 135 140 

aag gtc tgg ggc aag ace ggc tee aag ctg tac ggc ccc cgc tec ggc 
Lys Val Trp Gly Lys Thr Gly Ser Lys Leu Tyr Gly Pro Arg Ser Gly 
145 150 155 160 

get gac tac ctg gac aac cac aag cgc ttc gcc ctg ttc tgc aag gcc 528 
Ala Asp Tyr Leu Asp Asn His Lys Arg Phe Ala Leu Phe Cys Lys Ala 
165 170 175 

get att gag get gcc cgc gtg ctg ccc ttc ggc ccc ggc gag gac tgc 57 6 

Ala He Glu Ala Ala Arg Val Leu Pro Phe Gly Pro Gly Glu Asp Cys 
180 185 190 

gtc ttc gtg gcc aac gac tgg cac tec gee ctg gtg ccc gtc ctg ctg 624 
Val Phe Val Ala Asn Asp Trp His Ser Ala Leu Val Pro Val Leu Leu 
195 200 205 

aag gac gag tac cag ccc aag ggc cag ttc acc aag gcc aag teg gtg 672 
Lys Asp Glu Tyr Gin Pro Lys Gly Gin Phe Thr Lys Ala Lys Ser Val 
210 215 220 

ctg get ate cac aac ate gcc ttc cag ggc cgc atg tgg gag gag get 720 
Leu Ala He His Asn He Ala Phe Gin Gly Arg Met Trp Glu Glu Ala 
225 230 235 240 

ttc aag gac acg aag ctg ccc cag gee gcc ttt gac aag ctg gee ttc 768 
Phe Lys Asp Thr Lys Leu Pro Gin Ala Ala Phe Asp Lys Leu Ala Phe 
245 250 255 



816 



gac gag aag ccc ecg ctg acg gga aag acc tac aag aag ate aac tgg 864 
Asp Glu Lys Pro Pro Leu Thr Gly Lys Thr Tyr Lys Lys He Asn Trp 
275 280 285 



912 



aac tac gcg acc gag ate get gee gat gcc gcc ggc ggt gtg gag ctg 960 
Asn Tyr Ala Thr Glu He Ala Ala Asp Ala Ala Gly Gly Val Glu Leu 
305 310 315 320 



1008 



gac att gag gag tgg aac ccc aag acc gac aag ttc ctg tet gcg ccc 
Asp He Glu Glu Trp Asn Pro Lys Thr Asp Lys Phe Leu Ser Ala Pro 
340 345 350 



1056 
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tac gac cag aac age gtc tac gcc ggc aag gcc gcc gcc aag gag gcc 1104 
Tyr Asp Gin Asn Ser Val Tyr Ala Gly Lys Ala Ala Ala Lys Glu Ala 
355 360 365 

ctg cag gcc gag ctg ggc ctg cct gtg gac ccc acc gcc ccc ctg ttc 1152 
Leu Gin Ala Glu Leu Gly Leu Pro Val Asp Pro Thr Ala Pro Leu Phe 
370 375 380 

gcc ttc ate ggc cgc ctg gag gag cag aag ggt gtg gac ate ate ctg 1200 
Ala Phe He Gly Arg Leu Glu Glu Gin Lys Gly Val Asp He lie Leu 
385 390 395 400 

gcc gcc ctg ccc aag ate ctg gcc acc ccc aag gtg cag ate gcc ate 1248 
Ala Ala Leu Pro Lys He Leu Ala Thr Pro Lys Val Gin He Ala He 
405 410 415 

ctg ggt ace ggc aag gcc gcc tac gag aag ctg gtg aac gee ate ggc 12 96 

Leu Gly Thr Gly Lys Ala Ala Tyr Glu Lys Leu Val Asn Ala He Gly 
420 425 430 

acc aag tac aag ggc cgc gcc aag ggc gtg gtc aag ttc teg gcg ccc 1344 
Thr Lys Tyr Lys Gly Arg Ala Lys Gly Val Val Lys Phe Ser Ala Pro 
435 440 445 

ctg gcg cac atg etc acc gcc ggc gcc gac ttc atg ctg gtg ccc teg 1392 
Leu Ala His Met Leu Thr Ala Gly Ala Asp Phe Met Leu Val Pro Ser 
450 455 460 



cgc ttc gag ccc tgc ggc ctg ate cag ctg cac gcc atg cac tac ggt 
Arg Phe Glu Pro Cys Gly Leu He Gin Leu His Ala Met His Tyr Gly 
465 470 475 480 



1440 



acc gtg ccc gtg.gta gcc tee acc ggc ggc ctg gtc gac ace gtc aag 1488 
Thr Val Pro Val Val Ala Ser Thr Gly Gly Leu Val Asp Thr Val Lys 
485 490 495 

gag ggc gtc acc ggc ttc cac atg ggc gcc ctg aac ccc gac aag ctg 1536 
Glu Gly Val Thr Gly Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu 
500 505 510 

gac gag get gac gcc gac gcc ctg gcc gcc acc gtg cgc cgt gcc age 1584 
Asp Glu Ala Asp Ala Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser 
515 520 525 

gag gtg ttt gcg ggc ggc cgc tac ccc gag atg gtg gcc aac tgc ate 1632 
Glu Val Phe Ala Gly Gly Arg Tyr Pro Glu Met Val Ala Asn Cys He 
530 535 540 

age cag gac ctg tec tgg tee aag ccc gcc cag aag tgg gag ggc ctg 1680 
Ser Gin Asp Leu Ser Trp Ser Lys Pro Ala Gin Lys Trp Glu Gly Leu 
545 550 555 560 

ctg gag gag gtg gtg tac ggc aag ggc ggc gtg gcc acc gee aag aag 1728 
Leu Glu Glu Val Val Tyr Gly Lys Gly Gly Val Ala Thr Ala Lys Lys 
565 570 575 



gag gag ate aag gtg ccc gtt gcc gag aag ate ccc ggc gac ctg ccc 
Glu Glu He Lys Val Pro Val Ala Glu Lys He Pro Gly Asp Leu Pro 
580 585 590 



1776 
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gcc gtg tec tac gcc ccc aac acc ctg aag ccc gtg tec gcc tec gtg 1824 
Ala Val Ser Tyr Ala Pro Asn Thr Leu Lys Pro Val Ser Ala Ser Val 
595 600 605 

gag ggc aac ggc gcc gcc gcg ccc aag gtc ggc acc acc gcc ccc gcc 1872 
Glu Gly Asn Gly Ala Ala Ala Pro Lys Val Gly Thr Thr Ala Pro Ala 
610 615 620 

atg ggc gcg tgg cgc gcg acc acc ccc teg ggc ccc teg ccc gcc gcc 1920 
Met Gly Ala Trp Arg Ala Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala 
625 630 635 640 

gcc acc ccc aag gtg acc ace tac aag ccc gcc ctg ccc gee acc gcc 1968 
Ala Thr Pro Lys Val Thr Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala 
645 650 655 

aag ccc aag ace get ggc etc aag ctg gcc ggt gag gee tec acc ace 2016 
Lys Pro Lys Thr Ala Gly Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr 
660 665 670 

teg ace teg gag aac ggc get gcc tec aac ggc aac ggc aac ggt gcc 2064 
Ser Thr Ser Glu Asn Gly Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala 
675 680 685 

teg gcc tee aag acc teg get gcc aag ccc ctg gtc tec gee gcc acc 2112 
Ser Ala Ser Lys Thr Ser Ala Ala Lys Pro Leu Val Ser Ala Ala Thr 
690 695 700 

cgc aag tec gcc 2124 

Arg Lys Ser Ala 

705 



<210> 3 

<211> 708 

<212> PRT 

<213> artificial sequence 
<220> 

<223> fragment of the complete sequence of cDNA coding 
for the GBSSl of Chlamydomonas reinhardtii 

<400> 3 

Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro lie Val 
1 5 10 15 

He Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 
20 25 30 

Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Arg Ser Ala 
35 ' 40 45 

Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp He Val Met Val 
50 55 60 

Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 
65 70 75 80 



Thr Gly Gly Leu Pro He Glu . Leu Val Lys Arg Gly His Arg Val Met 
85 90 95 
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Thr lie Ala Pro 
100 

Val Val Val Asp 
115 

Lys Lys Gly Val 
130 

Lys Val Trp Gly 
145 

Ala Asp Tyr Leu 



Ala He Glu Ala 
180 

Val Phe Val Ala 
195 

Lys Asp Glu Tyr 
210 

Leu Ala He His 
225 

Phe Lys Asp Thr 



Ser Asp Gly Tyr 
260 

Asp Glu Lys Pro 
275 

Leu Lys Gly Gly 
290 

Asn Tyr Ala Thr 

305 

Asp Thr Val He 



Asp He Glu Glu 
340 

Tyr Asp Gin Asn 

355 

Leu Gin Ala Glu 

370 

Ala Phe He Gly 
385 

Ala Ala Leu Pro 



Arg Tyr Asp Gin 



He Met Gly Glu 
120 

His Arg Val Trp 

135 

Lys Thr Gly Ser 
150 

Asp Asn His Lys 
165 

Ala Arg Val Leu 



Asn Asp Trp His 
200 

Gin Pro Lys Gly 
215 

Asn He Ala Phe 
230 

Lys Leu Pro Gin 
245 

Ala Lys Val Tyr 



Pro Leu Thr Gly 
280 

He He Ala Ala 
295 

Glu He Ala Ala 
310 

Arg Ala Lys Gly 
325 

Trp Asn Pro Lys 



Ser Val Tyr Ala 
360 

Leu Gly Leu Pro 

375 

Arg Leu Glu Glu 
390 

Lys He Leu Ala 
405 



Tyr Ala Asp Ala 
105 

Lys Val Arg Tyr 



He Asp His Pro 
140 

Lys Leu Tyr Gly 
155 

Arg Phe Ala Leu 
170 

Pro Phe Gly Pro 
185 

Ser Ala Leu Val 



Gin Phe Thr Lys 
220 

Gin Gly Arg Met 
235 

Ala Ala Phe Asp 
250 

Thr Glu Ala Thr 
265 

Lys Thr Tyr Lys 



Asp Lys Leu Val 
300 

Asp Ala Ala Gly 
315 

He Glu Gly He 
330 

Thr Asp Lys Phe 
345 

Gly Lys Ala Ala 



Val Asp Pro Thr 
380 

Gin Lys Gly Val 

395 

Thr Pro Lys Val 
410 



Trp Asp Thr Ser 
110 

Phe His Ser He 
125 

Trp Phe Leu Ala 



Pro Arg Ser Gly 
160 

Phe Cys Lys Ala 
175 

Gly Glu Asp Cys 
190 

Pro Val Leu Leu 
205 

Ala Lys Ser Val 



Trp Glu Glu Ala 
240 

Lys Leu Ala Phe 
255 

Pro Met Glu Glu 
270 

Lys He Asn Trp 
285 

Thr Val Ser Pro 



Gly Val Glu Leu 

320 

Val Asn Gly Met 

335 

Leu Ser Ala Pro 
350 

Ala Lys Glu Ala 
365 

Ala Pro Leu Phe 



Asp He He Leu 
400 

Gin He Ala He 
415 
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L6u Gly Thr Gly Lys Ala Ala Tyr Glu Lys Leu Val Asn Ala lie Gly 
420 425 430 

Thr Lys Tyr Lys Gly Arg Ala Lys Gly Val Val Lys Phe Ser Ala Pro 
435 440 445 

Leu Ala His Met Leu Thr Ala Gly Ala Asp Phe Met Leu Val Pro Ser 
450 455 460 

Arg Phe Glu Pro Cys Gly Leu He Gin Leu His Ala Met His Tyr Gly 
465 470 475 480 

Thr Val Pro Val Val Ala Ser Thr Gly Gly Leu Val Asp Thr Val Lys 
485 490 495 

Glu Gly Val Thr Gly Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu 
500 505 510 

Asp Glu Ala Asp Ala Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser 
515 520 525 

Glu Val Phe Ala Gly Gly Arg Tyr Pro Glu Met Val Ala Asn Cys lie 
530^ 535 540 

Ser Gin Asp Leu Ser Trp Ser Lys Pro Ala Gin Lys Trp Glu Gly Leu 
545 550 555 560 

Leu Glu Glu Val Val Tyr Gly Lys Gly Gly Val Ala Thr Ala Lys Lys 
565 570 575 

Glu Glu He Lys Val Pro Val Ala Glu Lys He Pro Gly Asp Leu Pro 
580 585 590 

Ala Val Ser Tyr Ala Pro Asn Thr Leu Lys Pro Val Ser Ala Ser Val 
595 600 605 

Glu Gly Asn Gly Ala Ala Ala Pro Lys Val Gly Thr Thr Ala Pro Ala 
610 615 620 

Met Gly Ala Trp Arg Ala Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala 
625 630 635 ' 640 

Ala Thr Pro Lys Val Thr Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala 
645 650 655 

Lys Pro Lys Thr Ala Gly Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr 
660 665 670 

Ser Thr Ser Glu Asn Gly Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala 
675 680 685 

Ser Ala Ser Lys Thr Ser Ala Ala Lys Pro Leu Val Ser Ala Ala Thr 
690 695 700 



Arg Lys Ser Ala 
705 
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<210> 4 
<211> 1953 
<212> DNA 

<213> artificial sequence 
<220> 

<223> fragment of the complete sequence of cDNA coding 
for the GBSSI of Chlamydomonas reinhardtii and 
coding for the mature GBSSI protein 

<220> 

<221> CDS 

<222> (1)..{1953) 

<223> 

<400> 4 

gcg ctg gac ate gtg atg gtt get get gag gtc gcc cct tgg tec aag 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg ace att gee cct cge tac gac cag tac 
Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 



48 



96 



144 



get gac gcc tgg gac ace teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

gtc cge tac tte cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 75 80 

gac cac ccc tgg ttc ctg gcc aag gtc tgg ggc aag ace ggc tee aag 288 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gcc ctg ttc tgc aag gcc get att gag get gee cgc gtg ctg ccc 
Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 



384 



ttc ggc ccc ggc gag gac tgc gtc ttc gtg gee aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gee ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 



ttc acc aag gee aag teg gtg ctg get ate cac aac ate gee ttc cag 
Phe Thr Lys Ala Lys Ser Val Leu Ala He His Asn He Ala Phe Gin 
165 170 175 



528 
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1344 



1392 



1440 



1488 



ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

ggc ctg gtc gac acc gtc aag gag ggc gtc acc ggc ttc cac atg ggc 
Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 

gcc ctg aac ccc gac aag ctg gac gag get gac gcc gac gcc ctg gcc 
Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 
Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tec tgg tec aag ccc 
Glu Met Val Ala Asn Cys He Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 
Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

ggc gtg gcc acc gee aag aag gag gag ate aag gtg ccc gtt gcc gag 1584 
Gly Val Ala Thr Ala Lys Lys Glu Glu He Lys Val Pro Val Ala Glu 
515 520 525 

aag ate ccc ggc gac ctg ccc gcc gtg tec tac gee ccc aac ace ctg 
Lys He Pro Gly Asp Leu Pro Ala Val Ser Tyr Ala Pro Asn Thr Leu 
530 535 540 

aag ccc gtg tec gee tec gtg gag ggc aac ggc gcc gee gcg ccc aag 
Lys Pro Val Ser Ala Ser Val Glu Gly Asn Gly Ala Ala Ala Pro Lys 
545 550 555 560 



1632 



1680 



gtc ggc ace acc gcc ccc gcc atg ggc gcg tgg cgc gcg ace acc ccc 1728 
Val Gly Thr Thr Ala Pro Ala Met Gly Ala Trp Arg Ala Thr Thr Pro 
565 570 575 

teg ggc ccc teg ccc gcc gcc gcc acc ccc aag gtg acc acc tac aag 1776 
Ser Gly Pro Ser Pro Ala Ala Ala Thr Pro Lys Val Thr Thr Tyr Lys 
580 585 590 

ccc gee ctg ccc gcc ace gcc aag ccc aag acc get ggc etc aag ctg 
Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly Leu Lys Leu 
595 600 605 

gcc ggt gag gee tec ace ace teg acc teg gag aac ggc get gcc tec 
Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly Ala Ala Ser 
610 615 620 

aac ggc aac ggc aac ggt gcc teg gcc tec aag acc teg get gee aag 
Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser Ala Ala Lys 
625 630 635 640 

ccc ctg gtc tee gcc gee ace cgc aag tee gcc 
Pro Leu Val Ser Ala Ala Thr Arg Lys Ser Ala 
645 650 



1824 



1872 



1920 



1953 
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<210> 5 
<211> 651 
<212> PRT 

<213> artificial sequence 
<220> 

<223> fragment of the complete sequence of cDNA coding 
for the GBSSI of Chlamydomonas reinhardtii and 
coding for the mature GBSSI protein 

<400> 5 

Ala Leu Asp He Val, Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 

20 ' 25 30 

Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 ' 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 V 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

Phe Thr Lys Ala Lys Ser Val Leu Ala He His Asn He Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys He Asn Trp Leu Lys Gly Gly He He Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu He Ala Ala Asp 
245 250 255 



Ala Ala Gly Gly Val Glu Leu Asp Thr 
260 265 



Val He Arg Ala Lys 
270 



Gly He 
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■Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

Asp Pro Thr Ala Pro Leu Phe Ala- Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 350 

Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 

Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

Glu Met Val Ala Asn Cys lie Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 - 510 

Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val Pro Val Ala Glu 
515 520 525 

Lys lie Pro Gly Asp Leu Pro Ala Val Ser Tyr Ala Pro Asn Thr Leu 
530 535 540 

Lys Pro Val Ser Ala Ser Val Glu Gly Asn Gly Ala Ala Ala Pro Lys 
545 550 555 560 

Val Gly Thr Thr Ala Pro Ala Met Gly Ala Trp Arg Ala Thr Thr Pro 

565 570 575 

Ser Gly Pro Ser Pro Ala Ala Ala Thr Pro Lys Val Thr Thr Tyr Lys 
580 585 590 
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Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly Leu Lys Leu 
595 600 605 

Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly Ala Ala Ser 
610 615 620 

Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser Ala Ala Lys 
625 630 635 640 

Pro Leu Val Ser Ala Ala Thr Arg Lys Ser Ala 
645 650 



<210> 6 

<211> 1314 

<212> DNA 

<213> artificial sequence 
<220> 

<223> fragment of the complete cDNA coding for 
the GBSSl of Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> {1)..(1314) 

<223> 

<400> 6 

gcg ctg gac ate gtg atg gtt get get gag gtc gee cct tgg tec aag 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 

15 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gte 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 ' 30 

aag cgc ggc cac cgc gtc atg ace att gee cet cgc tac gac eag tac 
Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gcc tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 ' 55 60 

gte cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 
Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

gac cac ccc tgg ttc ctg gcc aag gte tgg ggc aag acc ggc tec aag 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gcc ctg ttc tgc aag gcc get att gag get gcc cgc gtg ctg ccc 
Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 



48 



96 



144 



192 



240 



288 



336 



384 
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ttc ggc ccc ggc gag gac tgc gtc ttc gtg gcc aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gcc ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc acc aag gcc aag teg gtg ctg get ate eac aac ate gcc ttc cag 
Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 

ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gcc 57 6 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 



gcc ttt gac aag ctg gee ttc teg gac ggc tat gcc aag gtt tac act 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

gag gee ace ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 



225 230 235 240 

aag ctg gtg act gtg teg ccc aac tac gcg ace gag ate get gcc gat 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 

245 250 255 

gcc gee ggc ggt gtg gag ctg gac acc gtc ate cgc gcc aag ggc att 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 

gag ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 

Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 



480 



528 



624 



672 



ace tac aag aag ate aac tgg ctg aag ggt ggc att ate gcc gee gac 720 
Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 



768 



816 



864 



gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gcc ggc 912 
Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

aag gcc gcc gee aag gag gcc ctg cag gcc gag ctg ggc ctg cet gtg 960 
Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

gac ccc ace gcc ccc ctg ttc gcc ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

aag ggt gtg gac ate ate ctg gcc gcc ctg ccc aag ate ctg gee ace 1056 
Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 350 



ccc aag gtg cag ate gee ate ctg ggt acc ggc aag gcc gcc tac gag 
Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 



1104 
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aag ctg gtg aac gcc ate ggc acc aag tac aag ggc cgc gcc aag ggc 1152 
Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

gtg gtc aag ttc teg gcg ecc etg gcg cac atg etc acc gcc gge gcc 1200 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

gac ttc atg ctg gtg ccc teg cgc ttc gag cce tgc ggc etg ate cag 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 

etg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

ggc ctg gtc gac acc gtc 
Gly Leu Val Asp Thr Val 
435 



1248 



1296 



<210> 7 

<211> 438 

<212> PRT 

<213> artificial sequence 
<220> 

<223> fragment of the complete cDNA coding 

for the GBSSI of Chlamydomonas reinhardtii 

<400> 7 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 



Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 
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Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn 
165 170 



He Ala Phe Gin 
175 



Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys 
180 185 



Leu Pro Gin Ala 
190 



Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala 
195 200 



Lys Val Tyr Thr 

205 



Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 



Thr Tyr Lys Lys He Asn Trp Leu Lys Gly Gly lie 
225 230 235 



He Ala Ala Asp 
240 



Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu 
245 250 



He Ala Ala Asp 
255 



Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg 
260 265 



Ala Lys Gly He 
270 



Glu Gly He Val Asn Gly Met Asp He Glu Glu Trp 
275 280 



Asn Pro Lys Thr 
285 



Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 



Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu 
305 310 315 



Gly Leu Pro Val 
320 



Asp Pro Thr Ala Pro Leu Phe Ala Phe He Gly Arg 
325 330 



Leu Glu Glu Gin 
335 



Lys Gly Val Asp He He Leu Ala Ala Leu Pro Lys 
340 345 



He Leu Ala Thr 
350 



Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys 
355 360 



Ala Ala Tyr Glu 
365 



Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 



Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu 
385 390 395 



Thr Ala Gly Ala 
400 



Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys 
405 410 



Gly Leu He Gin 
415 



Leu His Ala Met His Tyr Gly Thr Val Pro Val Val 
420 425 



Ala Ser Thr Gly 
430 



Gly Leu Val Asp Thr Val 
435 



<210> 8 

<211> 1593 

<212> DNA 

<213> artificial sequence 
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<220> 

<223> fragment of the complete cDNA coding for 
the GBSSI of Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (1)..{1593) 

<223> 

<400> 8 

gcg ctg gac ate gtg atg gtt get get gag gtc gee cet tgg tec aag 48 
Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
1 5 10-15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg acc att gcc cct cgc tac gac cag tac 144 
Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 '40 45 

( 

get gac gcc tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

gac cac ccc tgg ttc ctg gcc aag gtc tgg ggc aag acc ggc tec aag 288 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gee ctg ttc tge aag gee get att gag get gcc cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tge gtc ttc gtg gee aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gee ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc acc aag gcc aag teg gtg ctg get ate cac aac ate gcc ttc cag 528 
Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 

ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gcc 57 6 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 



gcc ttt gac aag ctg gcc ttc teg gac ggc tat gee aag gtt tac act 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 



624 
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gag gcc acc 
Glu Ala Thr 
210 

acc tac aag 
Thr Tyr Lys 

225 

aag ctg gtg 
Lys Leu Val 



gcc gcc ggc 
Ala Ala Gly 



gag ggc att 
Glu Gly He 
275 

gac aag ttc 
Asp Lys Phe 
290 

aag gcc gcc 
Lys Ala Ala 
305 

gac ccc acc 
Asp Pro Thr 



aag ggt gtg 
Lys Gly Val 



ccc aag gtg 
Pro Lys Val 
355 

aag ctg gtg 
Lys Leu Val 
370 

gtg gtc aag 
Val Val Lys 
385 

gac ttc atg 
Asp Phe Met 



ctg cac gcc 
Leu His Ala 



ggc ctg gtc 
Gly Leu Val 
435 



ccc atg gag 
Pro Met Glu 



aag ate aac 
Lys He Asn 
230 

act gtg teg 
Thr Val Ser 
245 

ggt gtg gag 
Gly Val Glu 
260 

gtg aac ggc 
Val Asn Gly 



ctg tct gcg 
Leu Ser Ala 



gcc aag gag 
Ala Lys Glu 
310 

gcc ccc ctg 
Ala Pro Leu 
325 

gac ate ate 
Asp He He 
340 

cag ate gee 
Gin He Ala 



aac gee ate 
Asn Ala He 



ttc teg gcg 
Phe Ser Ala 
390 

ctg gtg ccc 
Leu Val Pro 
405 

atg cac tac 
Met His Tyr 
420 

gac acc gtc 
Asp Thr Val 



gag gac gag 
Glu Asp Glu 
215 

tgg ctg aag 
Trp Leu Lys 



ccc aac tac 
Pro Asn Tyr 



ctg gac acc 
Leu Asp Thr 
265 

atg gac att 
Met Asp He 
280 

ccc tac. gac 
Pro Tyr Asp 
295 

gcc ctg cag 
Ala Leu Gin 



ttc gcc ttc 
Phe Ala Phe 



ctg gcc gcc 
Leu Ala Ala 
345 

ate ctg ggt 
He Leu Gly 
360 

ggc ace aag 
Gly Thr Lys 
375 

ccc ctg gcg 
Pro Leu Ala 



teg cgc ttc 
Ser Arg Phe 



ggt acc gtg 
Gly Thr Val 
425 

aag gag ggc 
Lys Glu Gly 
4 40 



aag ccc ccg 
Lys Pro Pro 
220 

ggt ggc att 
Gly Gly He 
235 

gcg acc gag 
Ala Thr Glu 
250 

gtc ate cgc 
Val He Arg 



gag gag tgg 
Glu Glu Trp 



cag aac age 
Gin Asn Ser 
300 

gcc gag ctg 
Ala Glu Leu 
315 

ate ggc cgc 
He Gly Arg 
330 

ctg ccc aag 
Leu Pro Lys 



acc ggc aag 
Thr Gly Lys 



tac aag ggc 
Tyr Lys Gly 
380 

cac atg etc 
His Met Leu 
395 

gag ccc tgc 
Glu Pro Cys 
410 

ccc gtg gta 
Pro Val Val 



gtc acc ggc 
Val Thr Gly 



ctg acg gga 
Leu Thr Gly 



ate gcc gcc 
He Ala Ala 



ate get gee 
He Ala Ala 
255 

gee aag ggc 
Ala Lys Gly 
270 

aac ccc aag 
Asn Pro Lys 
285 

gtc tac gee 
Val Tyr Ala 



ggc ctg cct 
Gly Leu Pro 



ctg gag gag 
Leu Glu Glu 
335 

ate ctg gee 
He Leu Ala 
350 

gcc gcc tac 
Ala Ala Tyr 
365 

cgc gcc aag 
Arg Ala Lys 



acc gcc ggc 
Thr Ala Gly 



ggc ctg ate 
Gly Leu He 
415 

gee tec aee 
Ala Ser Thr 
430 

ttc cae atg 
Phe His Met 
445 



aag 672 
Lys 



gac 720 

Asp 

240 

gat 7 68 

Asp 



att 816 
He 



ace 864 
Thr 



ggc • 912 
Gly 



gtg 960 

Val 

320 

eag 1008 
Gin 



acc 1056 
Thr 



gag 1104 
Glu 



ggc 1152 
Gly 



gcc 1200 

Ala 

400 

cag 1248 
Gin 



ggc 1296 
Gly 



ggc 1344 
Gly 
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1440 



1488 



gcc ctg aac gcc gac aag ctg gac gag get gac gcc gac gcc ctg gcc 1392 
Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 
Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tee tgg tee aag ecc 
Glu Met Val Ala Asn Cys He Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 
Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

ggc gtg gcc acc gcc aag aag gag gag ate aag gtg ccc gtt gcc gag 1584 
Gly Val Ala Thr Ala Lys Lys Glu Glu He Lys Val Pro Val Ala Glu 
515 520 525 

\ 

aag ate ccc 1593 
Lys He Pro 
530 



<210> 9 

<211> 531 

<212> PRT 

<213> artificial sequence 
<220> 

<223> fragment of the complete eDNA coding for 
the GBSSI of Chlamydomonas reinhardtii 

<400> 9 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 



Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 
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Ala Leu Val Pro 
145 

Phe Thr Lys Ala 



Val Leu Leu Lys 
150 

Lys Ser Val Leu 
165 



Asp Glu Tyr Gin 

155 

Ala lie His Asn 
170 



Pro Lys Gly Gin 
160 

He Ala Phe Gin 

175 



Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 



Thr Tyr Lys Lys He Asn Trp Leu Lys Gly Gly He He Ala Ala Asp 

225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu He Ala Ala Asp 

245 250 255 



Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg Ala Lys Gly He 
260 265 270 

Glu Gly He Val Asn Gly Met Asp He Glu Glu Trp Asn Pro Lys Thr 

275 280 285 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

Asp Pro Thr Ala Pro Leu Phe Ala Phe He Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp He He Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 350 

Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu He Gin 
405 410 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 



Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 



Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 
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Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

Glu Met Val Ala Asn Cys He Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

Gly Val Ala Thr Ala Lys Lys Glu Glu He Lys Val Pro Val Ala Glu 
515 520 525 



Lys He Pro 
530 



<210> 10 
<211> 23 
<212> PRT 

<213> artificial sequence 
<220> 

<22 3> fragment of GBSSI of Chlamydomonas reinhardtii 
<400> 1 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Gly Gly Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val 
20 



<210> 2 
<211> 23 
<212> PRT 

<213> artificial sequence 
<220> 

<223> fragment of GBSSI of Chlamydomonas reinhardtii 
<400> 2 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val 
20 



